Dual Formulation of Controlled Markov Diffï¿1⁄2usions and Its Application

نویسندگان

  • Fan Ye
  • Enlu Zhou
چکیده

Information relaxation and duality in Markov decision processes have been studied recently to derive upper bounds on the maximal expected reward (or lower bounds on the minimal expected cost). The idea is to relax the non-anticipativity constraint on the controls and impose a penalty to punish such a violation. In this paper we generalize this dual approach to controlled Markov diffusions. We develop the weak duality and strong duality results, and explore the structure of the optimal penalty. We demonstrate the use of this dual formulation by computing upper bounds on the optimal expected utility in a dynamic portfolio choice problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Markov Processes to the Machine Delays Analysis

Production and non-productive equipment and personnel delays are a critical element of any production system. The frequency and length of delays impact heavily on the production and economic efficiency of these systems. Machining processes in wood industry are particularly vulnerable to productive and non-productive delays. Whereas, traditional manufacturing industries usually operate on homoge...

متن کامل

Synthesis and characterization of novel dual UV/thermal curable epoxy rosinate

Rosin has a comprehensive application in adhesives, printing inks, protective coatings, rubbers and pharmaceutical. In this work, novel dual UV/thermal curable epoxy rosinate was synthesized by esterification reaction between epoxy resin and purified rosin. This product was evaluated by FT-IR spectroscopy techniques and acid number. UV curable resin was formulated for UV curing ability by benzo...

متن کامل

Mapping Activity Diagram to Petri Net: Application of Markov Theory for Analyzing Non-Functional Parameters

The quality of an architectural design of a software system has a great influence on achieving non-functional requirements of a system. A regular software development project is often influenced by non-functional factors such as the customers' expectations about the performance and reliability of the software as well as the reduction of underlying risks. The evaluation of non-functional paramet...

متن کامل

AN APPLICATION OF TRAJECTORIES AMBIGUITY IN TWO-STATE MARKOV CHAIN

In this paper, the ambiguity of nite state irreducible Markov chain trajectories is reminded and is obtained for two state Markov chain. I give an applicable example of this concept in President election

متن کامل

The Limit Behavior of Dual Markov Branching Processes

A dualMarkov branching process (DMBP) is by definition a Siegmund’s predual of some Markov branching process (MBP). Such a process does exist and is uniquely determined by the so-called dual-branching property. Its q-matrix Q is derived and proved to be regular and monotone. Several equivalent definitions for a DMBP are given. The criteria for transience, positive recurrence, strong ergodicity,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014